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ABSTRACT 



Aims. We present a comparison of two methods of fitting solar-like variability to increase the efficiency of detection of Earth-like planetary 
transits across the disk of a Sun-like star. One of them is the harmonic fitting method that coupled with the BLS detection algorithm demon- 
strated the best performance during the first CoRoT blind test. 

Methods. We apply a Monte Carlo approach by simulating a large number of light curves of duration 150 days for different values of planetary 
radius, orbital period, epoch of the first transit, and standard deviation of the photon shot noise. Stellar variability is assumed in all the cases to 
be given by the Total Solar Irradiance variations as observed close to the maximum of solar cycle 23. After fitting solar variability, transits are 
searched for by means of the BLS algorithm. 

Results. We find that a model based on three point-like active regions is better suited than a best fit with a linear combination of 200 harmonic 
functions to reduce the impact of stellar microvariability provided that the standard deviation of the noise is 2 - 4 times larger than the central 
depth of the transits. On the other hand, the 200-harmonic fit is better when the standard deviation of the noise is comparable to the transit 
depth. 

Conclusions. Our results show the advantage of a model including a simple but physically motivated treatment of stellar microvariability for 
the detection of planetary transits when the standard deviation of the photon shot noise is greater than the transit depth and stellar variability is 
analogous to solar irradiance variations. 

Key words, planetary systems - methods: data analysis - techniques: photometric - stars: activity - stars: late-type 



1. Introduction 



Since the pioneering work by iMavor & Oueloa d 1 995b . about 
270 extrasolar planets have been discovered, mostly through 
the periodic oscillations of the radial velocity of their parent 
star^|. About 35 show transits across the disk of their stars al- 
lowing us to remove degeneracy between orbital inclination 
and amplitude of the radial velocity curve, thus providing in- 
formation on planet's mass and radius. Transit searches from 
the ground are limited to giant planets because their photomet- 
ric precision cannot be pushed beyond the millimagnitude limit 
due to the scintillation of the Earth atmosphere. This limitation 
can be overcome by space-borne telescopes. The space mis- 
sions CoRoT and Kepler have been specifically designed for 
searching planetary transits and can reach the photometric pre- 
cision to detect transits of terrestrial planets in front of late- 
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type main-sequenc e stars ([Baglin 2003; M outou et al 
Borucki et ai]|2004l) . 



2005 



The main source of noise in the detection of terrestrial 
planet transits at the level of precision of space-borne photom- 
etry is the intrinsic variability of the parent star. For late-type 
stars having an outer convection zone, magnetic activity is the 
main source of variability and can severely limit the capabil- 
ity of detecting transits even in relatively inactive stars. In the 
case of the Sun, the transit of a large sunspot group across the 
disk produces a dip of the solar irradiance of 2000 - 2500 parts 
per million (ppm), whereas the central transit o f an earth-size d 
planet produces a flux decrease of only 84 ppm dJenkinsl2002l) . 
During phases of minimum activity, even when no sizeable spot 
group is detected, the solar flux is modulated by faculae, with 
amplitudes usually exceeding 100-200 ppm. 

In principle, the extraction of the transit signal from a 
light curve contaminated by stellar microvariability is possible 
thanks to the different timescales characterizing transit dura- 
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tion and stellar variability. Specifically, the former ranges from 
a few hours to about one day, in the case of terrestrial planets 
with orbital periods up to about one year around a solar-like 
star, while the latter is dominated by the rotational modulation 
of the largest active regions and their evolution, taking place on 
time scales from weeks to months in the case of the Sun (cf., 
e.g., IJenkinslEool lLanza et al.ll2003l 12007 . and references 
therein). 

Several methods have been proposed to reduce the im- 
pact of solar-like variability on transit detec tion, starting from 



simp le high-pass filters dDe fav et al. 



ters (Jenkins et al. 2002: ICarpano et al. 



2001) to optimal fil- 



2003). Those methods 



work well only for continuous, evenly sampled timeseries be- 
cause gaps or irregular sampling introduce aliasing problems. 
Moreover, optimal filters need a sufficiently long timeseries to 
estimate the power spectrum of the intrinsic stellar variability 
from the data themselves in order to construct the filter. For a 
slowly rotating star as the Sun, this implies a time series of at 
least two or three months, given that most of the variabi lity is 
on rotational timescales. Recently ! Aigrain & Irwinl (120041) have 
proposed methods to overcome the difficulties due to irregular 
sampling or gaps, based on a least square fit of individual sine 
and cosine components or on the application of iterative non- 
linear filters. 

The first CoRoT blind test ( Moutou et al.l2005 ) represented 
a testbed for several different methods to treat solar-like vari- 
ability and showed the potentiality of those methods in com- 
bination with suitable transit finding algorithms to detect tran- 
sits in a quite large set of simulated light curves including also 
gaps and instrumental effects. The best method to remove stel- 
lar microvariability proved to be a best fit based on the use of 
200 sine and cosine har monic functions cou pled with the BLS 
transit finder algorithm (IKovacs et al.l 120021) . proposed by the 
Geneva team. 

In the present paper, we propose a different method to 
treat stellar variability, based on our mo del of the Total 
Solar Irradiance (hereinafter TSI) variations (lLanza et al . 2003 



2007). Previous models do not exploit knowledge of the cause 
of the variation, that is the intrinsic evolution and the rotational 
modulation of the visibility of active regions, while our model 
includes such effects. The residuals obtained after subtracting 
our model have a maximum amplitu de of 100-150 ppm and 
an almost Gaussian distribution (cf. lLanza et al. 2003, and 
Sect. I3.lt . suggesting us to apply algorithms developed for the 
detection of planetary transits in the presence of a Gaussian 
white noise. The performance of the proposed approach will 
be compared with that of the 200-harmonic best-fit algorithm 
of the CoRoT blind test to establish advantages and limitations 
of the new method. Moreover, other methods will be briefly 
discussed in Sect. [5] deferring a detailed comparison to succes- 
sive papers. 

2. Light curve simulations 

To assess the performance of the proposed method, we shall 
analyse an extensive set of simulated light curves. Each light 
curve includes flux variations produced by Sun-like activity, 
planetary transits and photon shot noise, respectively. 



To simulate the effects of Sun-like activity, we choose a 
subset of the TSI variation of duration 170 days ranging from 
January 1 st to June 19 th 2001 as observed by the radiome- 
ters of the VIRGO experiment on board of the SoHO satel- 
lite (|ft5hlichetaD [Hil [l997l see Fig. Q). Specifically, we 
use the 2.0-level TSI time series with a time sampling of one 
value per hour, accessible from the VIRGO web site at the 
Physikalisch-Meteorologisches Observatorium Davos - World 
Radiation CenteiQ. The standard deviation of the hourly TSI 
data is 20 ppm. Short quasi-periodic gaps, not exceeding a few 
hours, are present in the TSI time series and are retained in our 
simulations as representative of the gaps expected for a space- 
borne photometric experiment. Their total duration is 36 hours, 
i.e., 0.9 per cent of the time series (see Fig. [I). 

The considered time series falls close to the maximum of 
activity cycle 23 and shows flux variations up to 2600 ppm 
due to the transits of large active regions across the solar disk. 
Therefore, it is well suited to study the impact of the largest 
Sun-like variability on planetary transit detection. Note that the 
level of activity of the Sun at the maximum of the 1 1 -yr cycle is 
close to the median activity l evel of field dw arf stars of spectral 
types F and G, as found by lSchmittl (1 19971) by comparing their 
coronal X-ray fluxes. 

We prefer to use real solar irradiance data instead of sim- 
ulating st ellar microva riability w ith the app r oache s proposed 
by, e.g., lAigrain et all d2004 or lLanza et all J2006I) . Actually, 
those models are based on extrapolations of the solar variabil- 
ity characteristics to the case of more active stars and cannot be 
tested observationally yet. 

To simulate stellar variability for intervals of 150 days, such 
as those of the long runs of the space experiment CoRoT, we 
extract 150-d sections of the above TSI time series choosing 
starting points at random between and 20 days. In such a 
way, we get different realizations of Sun-like activity reducing 
possible systematic effects associated with the variations con- 
tained in a fixed TSI subset (cf. Sect. l3.lt . 

The light curves of planetary transits are computed by 
means of the IDL procedure UTM (lDeeglll999l) . We consider 
circular planetary orbits with periods of 5, 10, 25 and 50 days, 
and an inclination of 90°, i.e., the planet transits across the cen- 
tre of the disk of the star. The radius of the planet is varied be- 
tween 1.0 and 2.0 R e in steps of 0.25 R ffi . A radius of 1 R and 
a mass of 1 M are assumed for the star and a quadratic limb- 
darkening law is adopted for the stellar photosphere. Therefore, 
the duration of the transits ranges from 3.11 hours for an orbital 
period of 5 days up to 6.67 hours for a period of 50 days. 

In such a way, a total of 20 different transit light curves has 
been simulated. Finally, we shift the initial transit to a random 
phase between 0.0 and 1.0, taken from a uniform distribution, 
and add stellar variability and photon shot noise assuming that 
Poisson statistics can be well approximated by Gaussian statis- 
tics at the photon count levels characteristic of planetary tran- 
sit searches. We consider four levels of noise, i.e., with stan- 
dard deviations of 100, 200, 300 and 1000 ppm, which approx- 
imately correspond to the noise levels reached in one hour in- 
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Fig. 1. The Total Solar Irradiance time series (level 2.0 data of the VIRGO experiment) used to simulate Sun-like microvariability 
in the present investigation. Small horizontal red dashes at the bottom of the plot mark gaps in the time series. 



tegration time for G2V stars of magnitudes V = 12, 13, 14 
and 16 as observed by CoRoT, respectively One hundred in- 
dependent white Gaussian noise realizations are computed for 
each value of the standard deviation and added to each of the 
noiseless light curves giving a total of 8,000 light curves with 
transits. 

In a search for planetary transits it is important to estimate 
the number of false alarms produced by those random combi- 
nations of the nois e that may approximate a transit feature (see 
Doyle et al.ll2.000l and Sect. |4). We determine the frequency 
of false alarms by analysing transitless light curves obtained 
by combining solar irradiance variations with 100 random se- 
quences for each of the above noise standard deviations, for a 
total of 400 light curves. 

3. Analysis of simulated light curves 

The simulated light curves are analysed in two steps. First we 
fit stellar microvariability by applying our m odel based on three 



active regions (hereinafter 3-spot model; lLanza et al.l 12003) 
and the model proposed by the Geneva team for the CoRoT 
blind test, based on a least-square fit with harmonic func - 



tions (hereinafter 200-harmonic fitting; iMoutou et al. 2005) 



Secon dly, we apply the B ox-fitting Least Square algorithm (or 



BLS; iKovacs et al.l l2002) to search for transits in the time se- 



ries of the residuals obtained with the two different variability 
models to see which one gives the best performance in terms 
of detected transits and reduced false alarms. The residuals of 
the transitless light curves are analysed in the same way to find 
the frequency of false alarms produced by noise. 

3.1 . The three-spot model of Sun-like variability 

The model is described by Lanza et al. I d2003l). andjte perfor- 
mance is studied in detail by lLanza et al.l ( 2003 , 2007 ) in the 
case of the Sun at different phases of the 1 1-yr cycle. Therefore, 
we limit ourselves to a brief introduction. The model makes use 
of three active regions, containing both cool spots and warm 



faculae, to fit the rotational modulation of the TSI, and a uni- 
formly distributed background component to fit the variation of 
its mean level along the 1 1-yr cycle. The ratio of the area of the 
faculae to that of the sunspots in each active region is fixed. 

The rotation period P m is assumed as a free parameter 
ranging from 23.0 to 33.5 days, and its value is determined, 
toget her with the other free parameters, by minimizing the^f 2 
(see lLanza et al. 2003). The inclination of the rotation axis i 



http://corotsol.obspm.fr/web-instrum/payload.param/index.html 



is set at the solar value during the fitting process. Considering 
also the mean flux level, the areas and the co-ordinates of the 
three active regions, the number of free parameters is eleven. 

The timescale of evolution of the active region pattern on 
the Sun is significantly shorter than the rotation period. The 
longest time interval that c an be modelled w ith three stable ac- 
tive regions is 14 days (cf. lLanza et al. 2003 ). Hence, the time 
series is subdivided into subsets of 14 days, each of which is 
fitted with the model described above. The duration of a transit 
is at least one order of magnitude shorter than the timescale of 
variation of the active region configuration, ensuring that the 
model is not significantly affected by transit dips. Small dis- 
continuities at the level of ~ 30-50 ppm are sometimes present 
at the matching points between successive 14-d best fits. Their 
impact on our results is minimized by the random choice of the 
origin of the TSI time series used to simulate stellar activity, 
ensuring that those features never fall at the same points of the 
solar variation series. 

In Fig. [2] we plot with solid lines the distributions of the 
residuals of the best fit to the TSI from 6 February 1996 to 6 
February 2007, distinguishing three time intervals correspond- 
ing to the minimum, intermediate and maximum levels of ac- 
tivity along solar cycle 23, respectively. The distributions are 
well approximated by Gaussians except in the tails that ap- 
pear slightly higher than a normal distribution. Power spectra 
of the residuals are plotted in Fig. [3] (left panel) revealing a 
characteristic i nverse dependence on frequency up to 0.25 hr _1 
(Jenkins 2002) and an almost white spectrum at higher frequen- 
cies. Note the high level of power during the intermediate phase 
of cycle 23 owing to the appearance of large sunspot groups in 
the decaying phase of the cycle. 
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Fig. 2. Distributions of the residuals of the TSI best fit along 
solar cycle 23 obtained with the 3-spot model (solid black 
histograms) and the 200-harmonic fitting (dashed green his- 
tograms) for the three labelled time intervals corresponding 
to the minimum (top panel), intermediate (middle panel) and 
maximum (bottom panel) of activity. Solid black lines and 
green dashed lines are Gaussian best fits to the 3-spot and the 
200-harmonic residual distributions, respectively. The standard 
deviations of the Gaussian best fits are indicated in the panels 
with cr 3s and cr h referring to the 3-spot model and the 200- 
harmonic fitting, respectively. 

3.2. Modelling Sun-like variability by 200-harmonic 
fitting 

Stellar variability is modelled as a sum of harmonics of a fun- 
damental frequency /l = 57, where T is the whole duration of 
each time series, i.e., ~ 150 days. The number of considered 
harmonics is fixed at Nl = 200, so the highest frequency in the 
model corresponds to a period of 1.5 days, well below the fre- 
quency range where most of the transit power is concentrated. 
Therefore, the fitting technique is not expected to greatly affect 
the transit signal. The model flux at time f,- is given by: 



,\<L 



,\<L 



Fiin) = a + Y^ c k cos(2nkf L tj) + ^d k sm(2nkf L ti). 



(1) 



k=l 



k=\ 



Including the average level a,Q, the model has 401 free param- 
eters that can be determined by a least-square fit. We applied 



Singular Value Decomposition (hereinafter SVD) to solve the 
system of n ormal equat i ons for this least-square problem, as 
described by lPress et al. Ul992h . 

The distributions of the residuals are well approximated by 
Gaussians except in the tails that appear slightly higher than a 
normal distribution (see Fig. [2] dashed green plots). Their stan- 
dard deviations are smaller than those of the residuals obtained 
with the 3-spot model. Power spectra of the residuals are plot- 
ted in Fig.[3](right panel) and are similar to those obtained with 
the 3-spot model, except for the cutoff at a frequency corre- 
sponding to the period of 1.5 days. 

3.3. Searching for planetary transits 

The time series of the residuals obtained by the above fitting 
approaches are analysed to detect transits by means of the BLS 
algorithm. It assumes that transits are strictly periodic with a 
period P and that the noiseless time series assumes only two 
values, i.e., H outside of transits and L inside transits. In other 
words, each transit is assumed to have a simple box-shaped 
profile, that is enough for detection purposes. The time spent in 
transit at the level L is qP, where the fractionary transit length 
q is a small number q « 0.001 - 0.1. The time of the first 
transit in the time series is indicated by ?o. In the presence of 
a white Gaussian noise with a constant standard deviation <x, 
assuming that the average level of the time series is zero, the 
algorithm aims to find the best model by an estimation of the 
four free parameters P, q, 6 = H - L, and to that minimize 
the x 2 between the model and the data. We refer the reader to 
Kovacs et al.1 (2002) for further details. Here we note only that a 
useful measure of the signal-to-noise ratio of a transit detection 
is given by: 



a = — -Jnq, 
cr 



(2) 



where n is the number of points in the whole time series (i.e., 
n ^ 3600 in our 150-day time series with one-hour sampling), 
and cr/ yjnq is the standard deviation of the average of all mea- 
surements during transits. According to lKovacs et al.1 (120021) . a 
transit detection can be considered significant at the 3-cr level 
against false alarms if a k, 6. In the case of our tests, we rede- 
termine appropriate a thresholds (see Sect. |4j. 

The implementation of the BLS algorithm is per- 
formed by means of the Fortran subroutine accessi- 
ble at http://www.konkoly.hu/staff/kovacs.html We 
slightly modified it to avoid negative values of a that would 
correspond to periodic positive spikes, retaining only best fits 
with a positive value of a that correspond to transit dips. The 
parameter space is searched for transits with a period P be- 
tween 1 and 150 days, with 30,000 values of the frequency uni- 
formly spaced between and 1 day -1 ; the fractionary length 
of the transit q is varied between 0.002 and 0.08, assuming a 
number of 700 phase bins in the light curve folded at each trial 
period. Such a sampling of the parameter space ensures that 
none of the simulated transits will get lost because of an in- 
appropriate choice of the free parameters in our search (for an 
optimal choice of the free parameter steps and ranges see , e.g., 
Kovacs et al]|2002t iDovle et alfeOOOtUenkins et alfeooll) . 
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Fig. 3. Power spectra of the residuals of the TSI best fit obtained with the 3-spot model (left panel) and the 200-harmonic fitting 
(right panel). Different linestyles and colors indicate different levels of activity: black, solid line: maximum; green, dashed line: 
intermediate; and, red, dotted line: minimum. 



The large computational load of our experiment is managed 
by running our analyses on a large grid infrastructure consisting 
of three high-performance computing nodes located in Catania, 
Palermo and Messina. The total number of CPUs is 560, each 
one being an AMD Opteron dual- core 64-bit processor with 1 
or 2 GB of RAM jBeccianill2007l) . The CPU time for fitting 
solar variability and performing transit search is on the average 
~ 1 1 minutes per light curve with the 3-spot method and ~ 4 
minutes per light curve with the 200-harmonic method, while 
about 2.5 days of elapsed time have been necessary to analyse 
our complete set of 8,000 light curves. 



4. Results 

A typical sample of our results is given in Fig. HI where the 
distributions of the values of a are plotted for simulations with 
a planet of 1 .5 R ffi , the labelled values of the orbital period P, 
and a standard deviation of the noise cr = 200 ppm. The ver- 
tical dash-dotted lines indicate the false-alarm thresholds for 
the 200-harmonic fitting a t = 6.3 and for the 3-spot model 
a t - 6.5, respectively. They are derived from the analysis of the 
transitless light curves under the requisite that the frequency of 
false alarms be < 0.01 (i.e, not exceeding one in a hundred). 

The histograms in Fig. @] show the distributions of a for 
the cases where the orbital period of the planet was correctly 
identified or incorrectly identified, respectively. For a correct 
identification of the period, we require that the orbital period 
given by the BLS routine is within +0.1 days from the true 
period. When the period is correctly identified and a > a t , 
we count the detection as a positive one. Note that very few 
cases with a > a t and P outside of the acceptable range are 
visible in the panels of Fig. [4] These are not counted as positive 
detections. 



The frequencies of positive detections obtained for different 
values of planetary radius, orbital period and noise level are re- 
ported in TableQ] omitting records for values of cr giving no de- 
tections, except for the cases with the smallest orbital periods. 
In the first column we list the planetary radius R, in the sec- 
ond the standard deviation of the Gaussian photon shot noise 
cr, in the third the orbital period P, in the fourth the frequency 
of positive detections with the 200-harmonic method Fh and 
in the fifth the frequency of positive detections with the 3-spot 
method F^ s , respectively. We note that for a noise standard de- 
viation of 100 ppm, the 200-harmonic method is better than the 
3-spot method, especially for planetary radii R < 1.5 R e . On 
the other hand, for higher noise levels (cr > 200 ppm), the 3- 
spot method gives a better performance than the 200-harmonic 
method, especially for planetary radii R > 1.5 R ffi . 

As an illustrative application of our results, let us consider, 
e.g., a G2V star of V = 12 observed in the CoRoT white-light 
exochannel with an integration time of one hour. In this case, 
cr ^ 100 ppm and planets with radii as small as 1 .25 R e and or- 
bital period P < 10 days should be detectable in more than 95 
per cent of the cases if stellar variability and photon shot noise 
are the only effects limiting photometric precision. However, 
a proper assessment of the performance of a giv en space in 



strument should consider also other effects (see iBorde et al 



2003; M outou et al.ll2005l) . which goes beyond the purpose of 
the present study aimed at comparing different stellar activity 
fitting methods. 

5. Discussion 

The results presented above prompted us to investigate the 
cause of the different performance of the two methods at dif- 
ferent levels of noise and planetary radii. We found that the 
200-harmonic method, having 401 degrees of freedom to fit 
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Table 1. Fraction of positive detections obtained in our experi- 
ment. 
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Fig. 4. Distributions of the values of a as defined by Eq.© 
obtained by analysing light curves with transits of a planet of 
1.5 Re. Each set is characterized by a different orbital period 
of the planet (as labelled). The standard deviation of the white 
Gaussian noise is in all cases 200 ppm. The red and blue ver- 
tical dot-dashed lines indicate the 1 per cent threshold for the 
200-harmonic fitting and the 3-spot model, respectively. Red 
solid histograms show the statistics of light curves where the 
period was correctly identified after applying the 200-harmonic 
method, blue solid histograms those after applying the 3-spot 
method, respectively. Dashed histograms refer to the statistics 
of light curves where the period was incorrectly identified, with 
the same meaning of the colors. 



the entire 150-d sequence, is better suited to reproduce the flux 
perturbations due to magnetic activity and noise than the 3-spot 
method, which has only 1 1 free parameters for each 14-d best 
fit (cf. Figs. |2]and[3]l. However, the use of a set of orthogo- 
nal functions to fit stellar variability produces a significant dis- 
tortion of the planetary transits in the residual light curves. In 
particular, the depth of the transits is reduced, as it is clear by 
considering the best fits obtained in the case of a light curve 
containing only transits (cf. Fig. [5j. This i s a consequence of 



the w ell-known Gibbs' phenomenon (e.g., iMorse & Fesh bach 
1954 ). Since the transit dip is mathematically similar to a cou- 
ple of close discontinuities of the first kind, the fitting function, 
which is a truncated series of orthogonal trigonometric func- 
tions, overshoots the flux variation at the ingress and at the 
bottom of the transit leading to a reduction of its depth 6 in 
the residual curve. Note that the problem is not alleviated by 
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increasing the order Nl of the fitting functions, because it is 
a consequence of t he orthogonal nature of th e adopted set of 
fitting functions (cf. lMorse & Feshbachlll954l) . When the stan- 
dard deviation of the noise is small (<x = 100 ppm), the reduc- 
tion of 5 does not greatly affect the detection efficiency because 
a, although reduced (cf. Eq. 12), is still above the significance 
threshold. Hence, the 200-harmonic method performs better 
than the 3-spot method since it fits better the light variations. 
On the other hand, when the noise level increases (<x > 200 
ppm), the reduction of 6 has a larger impact on the a-statistics 
as more and more values of a fall below the threshold making 
the performance of the 200-harmonic method worse than that 
of the 3-spot method (see Fig.|6]l. Note that the 3-spot method 
is based on the use of only three fitting functions that are not 
orthogonal, thus it is not affected by the Gibbs' phenomenon. 
Moreover, the reduction of the transit depth in the residual light 
curve is very small because the fitting functions mainly follow 
the points outside transits, especially when the transit duration 
is short and the orbital period is long with respect to the adopted 
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Fig. 5. Residuals obtained with the 200-harmonic method (red 
dotted line) and the 3-spot method (blue dashed line) for a 
noiseless light curve containing only transits; the black solid 
line is the light variation during the transit. Note the Gibbs' 
phenomenon affecting the residuals obtained with the 200- 
harmonic fit and the smaller distortion of the transit profile ob- 
tained with the 3-spot method. 



14-d time interval during which the parameters controlling the 
shape of the fitting functions are kept fixed. 

The Gibbs' phenomenon affects also other filtering meth- 
ods proposed to reduce the impact o f stella r microvariabil- 
ity, such as, e.g., that of Carpano et alj (120031) . or the Wiener- 
like discrete filters by Ai grai n & Irwinl (120041) . Wavelet-based 
methods (e.g., Jenkins! l2002h may be useful to overcome 
the problems related to Gibbs' phenomenon, given the non- 
orthogonal nature of their basis functions, but they are nega- 
tively affected by gaps in the time serie s. On the other hand, the 
non-linear iterative filter proposed by lAigrain & Irwinl ([2004) 
is piratically insensitive to gaps or irregular sampling and does 
not make use of orthogonal functions. Therefore, it is worth a 
detailed comparison with our method and this will be the sub- 
ject of a forthcoming work. 

Note th at the 3-spot method per forms well in the presence 
of gaps (see Lanza et alj2003 , 2007 ) because the model is natu- 
rally interpolated through them to follow the observations. The 
main limitations of the 3-spot model are the computing time 
and the necessity of adjusting some of the parameters if the 
star is not analogous to the Sun. The computation time is re- 
ally not a problem when a parallel high-performance comput- 
ing system is available, as we have demonstrated through the 
extensive use of a grid-based system to perform our calcula- 
tions. As a matter of fact, the problem of analysing the large 
number of light curves coming from a space experiment is a 
fully-parallelizable problem and the elapsed time scales with 
the inverse of the number of available CPUs. 













^■.•.Y r .. 







Fig. 6. Signal-to-noise ratio obtained with the 200-harmonic 
fitting versus the corresponding value aj s obtained with the 3- 
spot model for light curves with R - 1.5 R e , P < 25 d and 
standard deviation of the noise of 200 ppm. The dot-dashed red 
and blue lines indicate the a thresholds for the 200-harmonic 
fitting and the 3-spot model, respectively. The yellow region of 
the plot indicates transits detected with the 3-spot model that 
go undetected with the 200-harmonic fitting. 

The problem of adjusting model parameters to analyse light 
cu rves of stars not stricly analogou s to the Sun has been treated 
bv lLanza et al. (2003, 2004, 2007) and will not be reconsidered 
here, since we intend to discuss it systematically in a successive 
paper. 

6. Conclusions 

We have performed extensive numerical experiments to com- 
pare two different methods of fitting Sun-like variability for the 
detection of Earth-like planetary transits. Our results show that 
the 3-spot model performs better than the 200-harmonic fitting 
when the standard deviation of the photon shot noise is 2 - 4 
times greater than the central depth of the transit. Conversely, 
the 200-harmonic fitting has a better performance when the 
noise standard deviation is comparable with the depth of the 
transit. This behaviour has been interpreted as a consequence 
of the Gibbs' phenomenon and is expected to be common to all 
filtering and fitting methods based on sets of orthogonal func- 
tions. On the other hand, the assumption of some general and 
simple physical hypotheses about the origin of Sun-like vari- 
ability allows the 3-spot method to reach a truly appealing per- 
formance, in spite of the fact that it has only eleven degrees of 
freedom. 

Therefore, it is worth to conduct further detailed compar- 
isons, in partic ular with non-l i near i terative filters, such as that 
proposed by lAigrain & Irwin (2004). We plan to perform such 
tests by simultaneously applying different methods to the same, 
large set of simulated light curves, thus extending the Monte 
Carlo approach of the present paper. 
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